Perceptual evaluation of blind source separation for robust speech recognition

نویسندگان

Leandro E. Di Persia

Diego H. Milone

Hugo Leonardo Rufiner

Masuzo Yanagida

چکیده

In a previous article, an evaluation of several objective quality measures as predictors of recognition rate after application of a blind source separation algorithm was reported. In this work, the experiments were repeated using some new measures, based on the perceptual evaluation of speech quality (PESQ), which is part of the ITU P862 standard for evaluation of communication systems. The raw PESQ and a nonlinearly transformed PESQ were evaluated, together with several composite measures. The results show that the PESQ-based measures outperformed all the measures reported in the previous work. Based on these results, we recommend the use of PESQ-based measures to evaluate blind source separation algorithms for automatic speech recognition.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Real-Time Prototype for Integration of Blind Source Extraction and Robust Automatic Speech Recognition

This demo presents a real-time prototype for automatic blind source extraction and speech recognition in presence of multiple interfering noise sources. Binaural recorded mixtures are processed by a combined Blind/Semi-Blind Source Separation algorithm in order to obtain an estimation of the target signal. The recovered target signal is segmented and used as input to a real-time automatic speec...

متن کامل

Evaluation of missing data techniques for in-car automatic speech recognition

One of the major concerns in deploying speech recognition applications is the lack of robustness of the technology. One key aspect is the sensitivity to stationary or non-stationary background noise. Many approaches to noise robust speech recognition have been proposed before. Some modify the front-end signal processing of the recogniser while others work on the back-end, i.e. modelling and dec...

متن کامل

A spatio-temporal speech enhancement scheme for robust speech recognition in noisy environments

A new speech enhancement scheme is presented integrating spatial and temporal signal processing methods for robust speech recognition in noisy environments. The scheme first separates spatially localized point sources from noisy speech signals recorded by two microphones. Blind source separation algorithms assuming no a priori knowledge about the sources involved are applied in this spatial pro...

متن کامل

A Two-Channel Acoustic Front-End for Robust Automatic Speech Recognition in Noisy and Reverberant Environments

An acoustic front-end for robust automatic speech recognition in noisy and reverberant environments is proposed in this contribution. It comprises a blind source separation-based signal extraction scheme and only requires two microphone signals. The proposed front-end and its integration into the recognition system is analyzed and evaluated in noisy living room-like environments according to th...

متن کامل

Spatio-temporal Speech Enhancement for Robust Speech Recognition

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

Signal Processing

دوره 88 شماره

صفحات -

تاریخ انتشار 2008

Perceptual evaluation of blind source separation for robust speech recognition

نویسندگان

چکیده

منابع مشابه

Real-Time Prototype for Integration of Blind Source Extraction and Robust Automatic Speech Recognition

Evaluation of missing data techniques for in-car automatic speech recognition

A spatio-temporal speech enhancement scheme for robust speech recognition in noisy environments

A Two-Channel Acoustic Front-End for Robust Automatic Speech Recognition in Noisy and Reverberant Environments

Spatio-temporal Speech Enhancement for Robust Speech Recognition

عنوان ژورنال:

اشتراک گذاری